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ABSTRACT 

Androgen receptor (AR) plays an important regula- 
tory role in prostate cancer. AR's transcriptional ac- 
tivity is regulated by androgenic ligands, but also by 
post-translational modifications, such as SUMOyla- 
tion. To study the role of AR SUMOylation in genuine 
chromatin environment, we compared androgen- 
regulated gene expression and AR chromatin occu- 
pancy in PC-3 prostate cancer cell lines stably ex- 
pressing wild-type (wt) or doubly SUMOylation site- 
mutated AR (AR-K386R,K520R). Our genome-wide 
gene expression analyses reveal that the SUMOy- 
lation modulates the AR function in a target gene 
and pathway selective manner. The transcripts that 
are differentially regulated by androgen and SUMOy- 
lation are linked to cellular movement, cell death, 
cellular proliferation, cellular development and cell 
cycle. Fittingly, SUMOylation mutant AR cells pro- 
liferate faster and are more sensitive to apoptosis. 
Moreover, ChlP-seq analyses show that the SUMOy- 
lation can modulate the chromatin occupancy of AR 
on many loci in a fashion that parallels their differ- 
ential androgen-regulated expression. De novo motif 
analyses reveal that F0XA1, C/EBP and AP-1 motifs 
are differentially enriched at the wtAR- and the AR- 
K386R,K520R-preferred genomic binding positions. 
Taken together, our data indicate that SUMOylation 
does not simply repress the AR activity, but it reg- 
ulates AR's interaction with the chromatin and the 
receptor's target gene selection. 

INTRODUCTION 

Prostate cancer is a major health concern among men by 
being one of the most common cancers diagnosed and one 
of the most common causes of cancer death [(1), http:// 
www.cancerresearchuk.org/]. The androgen receptor (AR) 



has an important role in the development and progres- 
sion of prostate cancer and regardless of the progress in 
prostate cancer pathobiology, the AR remains the ma- 
jor druggable target for the advanced disease. AR is an 
androgen-regulated transcription factor which in prostate 
is activated by the binding of 5ct-dihydrotestosterone. Sub- 
sequently, the AR moves to nucleus, binds to specific an- 
drogen response elements (AREs) on the regulatory regions 
of its target genes and in this way conveys the message of 
androgens directly to the level of gene programs (2-4). In 
addition to hormone binding, the AR activity is regulated 
by post-translational modifications, including SUMOyla- 
tion (5). SUMOylation is a reversible modification in which 
small ubiquitin-related modifier protein (SUMO) 1, 2 or 
3 is covalently attached to target proteins' specific lysine 
residues via an enzymatic E1^E2^E3 pathway analo- 
gous to ubiquitylation but with enzymes (El, SAEl/2; E2, 
UBC9; E3, e.g. PIAS proteins) distinct from the ubiquity- 
lation (6,7). The SUMOylation pathway does not gener- 
ally target proteins for degradation, but regulates proteins' 
activity and changes their interactions with other protein 
and/or subcellular or nuclear localization (6,8,9). 

Previous studies have shown that the N -terminal trans- 
activation domain of AR is covalently modified by SUMOs 
at two conserved lysine (in human sequence K386 and 
K520) residues in an androgen-inducible and reversible 
fashion (5,10). Moreover, SUMOylation pathway compo- 
nents act as AR coregulators in transcription assays (10- 
12). Disruption of these sites increases the transcriptional 
activity of AR on compound ARE-driven promoters in 
reporter gene assays, suggesting that the modification is 
linked to transcriptional repression. However, very little is 
known about the importance and role of the AR SUMOy- 
lation in a genuine chromatin environment and regulation 
of endogenous AR target genes in prostate cancer cells. To 
study in a systematic genome-wide fashion the role of AR 
SUMOylation in prostate cancer chromatin environment, 
we used PC-3 cell lines that stably express wild-type (wt) 
or SUMOylation-deficient AR (AR-K386R,K520R; AR- 
2KR) and analyzed their androgen-regulated transcripts. 
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We also compared the capabilities of these two AR forms 
to bind to chromatin by using chromatin immunoprecipita- 
tion coupled to deep sequencing (ChlP-seq). These genome- 
wide analyses that were additionally carried out in HEK293 
cells stably expressing wtAR or AR-2KR revealed that the 
AR SUMOylation sites do not simply repress the AR ac- 
tivity on all target genes. The mutant also exhibited atten- 
uated transcriptional activity on several genes and a group 
of target genes were insensitive to the SUMOylation. Inter- 
estingly, the genes differently expressed by androgen due to 
the AR SUMOylation sites are significantly enriched in cell 
proliferation and apoptosis pathways. Our cistrome analy- 
ses also show that the SUMOylation can regulate the recep- 
tor s chromatin occupancy in a locus-selective fashion. 

MATERIALS AND METHODS 

Cell culture 

Stably AR-expressing PC-3 prostate cancer cells were main- 
tained and generated as described in reference (13). Stably 
AR-expressing isogenic Flp-In-293 (HEK293) (Invitrogen) 
were maintained and generated as described in reference 
(14). In experiments steroid-depleted transfection medium 
was used (5% charcoal-stripped-FBS in F-12 for PC-3 cells 
and 2.5% charcoal-stripped-FBS in Dulbecco's modified 
Eagle's medium (DMEM) for HEK293). In both cell types, 
several clones were analyzed before continuing studies with 
one representative clone. 

RT-qPCR analysis 

PC-3 and HEK293 cells were seeded onto 6-well plates, 
grown 48 h in steroid-depleted transfection medium and 
then exposed to vehicle (EtOH) or 10 nM R1881 for 16 h 
(PC-3) or 24 h (HEK293). Total RNA of biological tripli- 
cates was extracted (TriPure isolation reagent, Roche) and 
converted to cDNA (Transcriptor First Strand cDNA syn- 
thesis Kit, Roche) according to manufacturer's instructions. 
Expression of AR target genes with specific primers (Sup- 
plementary Table SI) was measured by RT-qPCR as de- 
scribed in reference (15) using RPL13A (for PC-3) and 
GAPDH (for HEK293) mRNA levels to normalize the 
amounts of total RNA between the samples. 

Microarray analysis 

Total RNA of biological triplicates was collected, and hy- 
bridized to Illumina HumanHT-12 v3 (for HEK293 cells) 
or v4 (for PC-3 cells) Expression BeadChips (San Diego, 
CA, USA) at the Finnish Microarray and Sequencing Cen- 
tre (Turku, Finland) using protocols recommended by the 
manufacturer. The Illumina BeadChIP data were analyzed 
using the Bioconductor associated packages (16). Data were 
preprocessed (bgAdjust), variance stabilizing transformed 
(vst) and robust spline normalized (rsn) with lumi package 
(17) and analyzed using the Linear Models for Microarray 
Data (limma) package (18) (empirical Bayes statistics with 
a Benjamini and Hochberg multiple test correction proce- 
dure). A gene was considered significantly changed, if it 
had the adjusted P-value <0.05 and fold change <0.7 or 
> 1 .4. Heat maps were generated by using heatmap.2 in the 



R package gplots. Ingenuity Pathway Analysis® (IPA) was 
used to identify biological processes enriched for differen- 
tially expressed genes. First, a core analysis was performed 
with two distinct lists (androgen-regulated genes in wtAR 
or AR-2KR expressing cells). The results were then com- 
pared to identify any distinct biological processes that were 
differentially regulated. 

Cell proliferation and apoptosis assay 

In proliferation assay, PC-3 cells were seeded in 96-wells 
and cells were grown 24 h before addition of 10 nM R1881 
or vehicle (EtOH) and the amount of viable cells was mea- 
sured using CellTiter-Glo® Luminescent Cell Viability As- 
say (Promega) according to the manufacturer's instructions. 
Similarly, 48 h and 72 h after exposure viable cells were 
measured. The results were calculated as percentage of cell 
growth using the first measurement as reference point. In 
apoptosis assay, cells were cultured in 96-wells and grown 24 
h before treating with 1.5 jjlM camptothecin for 20 h. Apop- 
tosis was measured using Cell Death Detection ELISA'''"*^ 
kit (Roche) according to the manufacturer's instructions. 

Chip coupled to high-throughput sequencing (ChlP-seq) 

The chromatin immunoprecipitation (ChIP) experiments 
and analyses were performed as previously described (15). 
Briefly, PC-3 (13) or HEK293 cells (14) were seeded onto 
10-cm plates, grown 72 h in steroid-depleted transfection 
medium and exposed to vehicle (EtOH) or 10 nM R1881 
for 1 h (PC-3) or 40 min (HEK293) before crosslink- 
ing with 1% (v/v) formaldehyde and harvesting for im- 
munoprecipitation. ChlP-seq samples were immunoprecip- 
itated with anti-AR antibody (19) and for quantitative PCR 
(qPCR)-based ChIP assays, samples were immunoprecip- 
itated with anti-FOXAl (Abeam, ab23738), anti-c-Jun 
(Santa Cruz Biotechnology, sc-1694), anti-C/EBP|3 (Santa 
Cruz Biotechnology, sc-150) or rabbit-IgG (Santa Cruz 
Biotechnology, sc-2027). Specific primers used in qPCR 
analysis are listed in Supplementary Table SI. Single-end 
50 bp ChlP-seq data were generated using Illumina Hiseq 
System (Illumina) in the EMBL Genomics Core Facility 
(Heidelberg, Germany). The quality of raw reads was ana- 
lyzed by FastQC (http://www.bioinformatics.babraham.ac. 
uk/projects/fastqc/) and after that FASTX-toolkit (http:// 
hannonlab.cshl.edu/fastx_toolkit/) was used for preprocess- 
ing (removal of artifacts, 50-bp trimming, quality filter- 
ing and collapsing identical reads). Remaining reads were 
aligned against the human reference genome version hgl9 
with Bowtie software 0.12.7 (20), allowing one mismatch 
and accepting only the best alignment (with essential com- 
mand line arguments: -v 1 -k 1 -m 1 -f -S — best hgl9). 
Resulting Sequence Alignment/Map (SAM) files were an- 
alyzed using HOMER software version 4.2 (21). Enriched 
binding sites were detected using findPeaks command with 
factor analysis strategy using 3-fold enrichment over in- 
put background read set (own input for each cell line) 
and finally selecting binding sites for which the normal- 
ized tag counts were >6. Matching the ENCODE Con- 
sortium guidelines, we sequenced two independent biolog- 
ical replicates per sample and gained >10 million uniquely 
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mapped reads per replicate (22). Binding site overlaps be- 
tween each replicate pair, identified using BEDTools (23), 
also fulfilled the ENCODE requirements for replicate sim- 
ilarity. Initial overlap analysis of wtAR- or AR-2KR- ex- 
pressing cell binding sites was performed using BEDTools 
and non-shared binding sites were analyzed further using 
getDifferentialPeaks command of HOMER using a cutoff 
of 2-fold enrichment over the other sample to find final 
preferred binding sites for wtAR- and AR-2KR-expressing 
cells. HOMER software was also used to format the data 
for visualization in Integrative Genomics Viewer (IGV), ge- 
nomic annotation of peak positions and for the de novo 
motif analysis. De novo discovery of the motifs (length 
6,10,15,20) was performed on sequences covering ±100 bp 
of the AR peak center. Association of AR binding sites to 
nearby genes (±100 kb from transcription start site) and 
androgen-regulated genes was done with Anduril (Engine 
1.2.18) (24) with NextBioentity (Ensembl Homo Sapiens 
71.37) (Supplementary file 1). 

Accession numbers 

Bead array and ChlP-seq data are submitted to the NCBI 
Gene Expression Omnibus database (25) (http;//www.ncbi. 
nlm.nih.gov/geo/) and are accessible through GEO Series 
accession number GSE54202. 

RESULTS 

Androgen target genes are differently modulated by AR 
SUMOylation 

To study the role of the AR SUMOylation in genuine 
chromatin environment, we constructed PC-3 prostate can- 
cer cell lines stably expressing wild-type AR (wtAR) or 
SUMOylation deficient AR (AR-2KR) in which SUMO 
acceptor sites K386 and K520 were converted to arginine 
residues (Supplementary Figure SI A), (13). Immunoblot 
analysis of the cells confirmed that the wtAR and the AR- 
2KR are equally expressed and that their expression lev- 
els do not exceed that of endogenous AR in LNCaP cells 
and their subcellular distributions are comparable (Supple- 
mentary Figure SIB and C). Based on transfected reporter 
gene activity in the PC-3 cells, mutation of the SUMOy- 
lation sites increases the transcriptional activity of AR on 
compound ARE-driven promoters as shown previously for 
cells of nonprostatic origin (Supplementary Figure SID), 
(5). Moreover, our recent preliminary AR target gene anal- 
yses of isogenic HEK293 cell lines expressing wtAR or AR- 
2KR (at a level comparable to that of endogenous AR in 
VCaP prostate cancer cells) suggested that the effects of 
SUMOylation on the AR activity are target gene-dependent 
(14). We next compared gene expression profiles of the 
WtAR and AR-2KR PC-3 cells exposed to synthetic andro- 
gen (R 1881) or vehicle by using Illumina HT-12 v4 Expres- 
sion BeadChips. Comparison of our microarray data with 
those from VCaP cells showed a reasonable, ~30% overlap 
of androgen-regulated genes (Supplementary Figure S2). 
The genome-wide analysis indicated that more genes were 
androgen-regulated in the wtAR cells (473) than in the AR- 
2KR cells (143), with 47% and 61% of these genes being 
up-regulated by androgen in the wtAR and AR2-KR cells. 



respectively, (Figure lA, Supplementary file 2). The differ- 
ence between wtAR and AR-2KR remains also if other P- 
value cut offs are applied (Supplementary Table S2). This 
was also seen in genome-wide gene expression analysis of 
the HEK293 wtAR and AR-2KR cells (368 versus 178 reg- 
ulated genes; 49% versus 57% up-regulated by androgen) 
(Supplementary Figure S3A, Supplementary file 2). More 
interestingly, as shown by the Venn diagrams in Figure 
lA, only a small portion of androgen-regulated genes were 
common to both cells. Unsupervised hierarchical cluster- 
ing of the androgen-regulated genes in the PC-3 and the 
HEK293 cell lines further emphasized the differences in the 
androgen-regulated gene expression between the wtAR and 
the AR-2KR cells (Figure IB, Supplementary Figure S3B). 
Expression of one gene from each cluster in PC-3 cells was 
validated by RT-qPCR analysis (Figure IC). These analyses 
show that there are different classes of AR target genes re- 
garding their sensitivity to the SUMOylation. Some genes, 
such as FKBP5, are insensitive to the AR SUMOylation 
sites, being similarly expressed in both cell lines. A subset 
of genes, such as CLDN8, require the SUMOylation sites, 
as they are robustly androgen-regulated only in the wtAR 
cells, whereas certain genes, such as ADRAIB, are con- 
trastingly inhibited by the SUMOylation, as demonstrated 
by their enhanced expression in the AR-2KR cells. Simi- 
lar AR target gene classes were seen and validated in the 
HEK293 cells (Supplementary Figure S3B and C). Thus, 
our unbiased genome-wide expression data indicate that the 
SUMOylation modulates the AR function in a target gene- 
selective fashion instead of simply repressing the AR activ- 
ity, as previously suggested based on simple ARE-driven re- 
porter gene assays (5,10). 

AR SUMOylation regulates apoptosis sensitivity and cell 
proliferation of prostate cancer cells 

The androgen-regulated genes of both wtAR and AR-2KR 
cells were next subjected to IPA to identify enriched molec- 
ular and cellular functions and pathways regulated by the 
AR SUMOylation. The androgen-regulated genes in the 
AR-2KR PC-3 cells were enriched for genes associated with 
such biological functions as cellular movement, cellular de- 
velopment, cell death and proliferation (Figure 2A). Gener- 
ally, 13-37% of androgen-regulated genes in the AR-2KR 
cells were linked to these functions, whereas only 5-21% 
of the genes in the wtAR cells were linked to these func- 
tions. Many of the most differentially expressed genes due 
to AR mutation and androgen exposure were linked to sev- 
eral of these functions (Supplementary Figure S4). Exam- 
ples of such genes include RASDl (encoding a Ras-related 
protein that is stimulated by dexamethasone), EFEMPl 
(encoding a member of the fibulin family of extracellular 
matrix glycoproteins), LRIGI (encoding a transmembrane 
protein that interacts with epidermal growth factor recep- 
tor tyrosine kinases), and DJJSPl (encoding dual specificity 
protein phosphatase 1). In contrast, the androgen-regulated 
genes in the wtAR PC-3 cells showed enrichment in various 
biosynthesis-related canonical pathways, including andro- 
gen biosynthesis (Figure 2B), highlighting the differences in 
transcriptional regulation by wtAR and AR-2KR upon an- 
drogen stimulation. 
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Figure 1. SUMOylation sites modulate the AR activity in a target gene-selective fashion. Stably wtAR- and AR-2KR-expressing PC-3 prostate cancer cells 
were treated with R1881 (10 nM) or vehicle (ethanol, EtOH) for 16 h and isolated RNAs were analyzed by Illumina Expression BeadChips as described 
in 'Materials and Methods' section. The genes that had adjusted /"-value < 0.05 and fold-induction >1.4 or <0.7 were considered as androgen-regulated 
genes. (A) Venn diagrams showing androgen-up- and androgen-down-regulated genes in the wtAR- and the AR-2KR-expressing cells. (B) Heat map of 
androgen-regulated genes clustered by unsupervised hierarchical clustering. (C) RT-qPCR validation of one AR target gene of each cluster. The relative 
mRNA expressions represent the means (n = 3) ± SDs. Statistically significant differences (***/> < 0.001, **P < 0.01 and *P < 0.05; Student's f-test) 
between the wtAR and the AR-2KR after androgen exposure are indicated. 



Since the wtAR and the AR-2KR cells showed significant 
differences in their androgen-regulated expression of genes 
governing cellular proliferation and survival, we compared 
their growth rates and apoptosis sensitivities. Although an- 
drogen exposure increased the growth of both wtAR and 
AR-2KR cells, the AR-2KR-expressing PC-3 cells grew sig- 
nificantly faster than their wtAR counterpart cells (Figure 
3A). Similarly, the SUMOylation-deficient AR expressing 
HEK293 cells grew faster than the wtAR-expressing cells 
(Supplementary Figure S5). These results are in line with 
our gene expression data showing that androgen-regulated 
genes, e.g. RASDl and LRIGl. known to decrease cell pro- 
liferation were more robustly up-regulated in the wtAR ex- 
pressing cells and that genes, such as EFEMPl, DUSPl, 
ADRAIB (encoding alpha- IB-adrenergic receptor that can 
function as a protooncogene), reported to increase prolif- 
eration were up-regulated in the AR-2KR expressing cells 
(Figure 3B, Supplementary file 2), (26-30). To assess the 
apoptosis sensitivity, we exposed our PC-3 cells to an apop- 
tosis inducer camptothecin and monitored their cell death. 
The apoptosis assays showed that the AR-2KR cells are 
more sensitive to camptothecin-induced apoptosis than the 
WtAR cells (Figure 3C). 

Collectively these results strongly suggest that the molec- 
ular and cellular functions identified with the IPA are bi- 



ologically relevant and the SUMOylation is regulating the 
AR function in a gene pathway selective fashion. 

Effect of SUMOylation on AR cistrome 

To complement the AR target gene expression analyses, 
we next investigated whether SUMOylation influences the 
AR chromatin occupancy by identifying the genomic wtAR 
and AR-2KR binding sites using chromatin immunopre- 
cipitation coupled to deep sequencing (ChlP-seq) analyses. 
Our biological replicates showed excellent concordance, as 
judged by tag density comparisons (Supplementary Figure 
S6). The ChlP-seq analyses revealed 19 400 high confidence 
AR binding sites (ARBs) in androgen exposed wtAR PC-3 
cells and 23 059 ARBs in SUMOylation deficient AR cells. 
Based on initial overlap analysis shown in Figure 4A, most 
of these ARBs overlapped. Moreover, the tag density maps 
and average tag profiles of the wtAR and the AR-2KR oc- 
cupancy showed that the two receptors were loaded to the 
same extent to them (Figure 4A and B). Further analysis of 
non-shared binding sites lead to identification of two other 
ARB groups, wtAR-pref erred and AR-2KR-preferred sites, 
showing at least 2-fold difference between the occupancy of 
the WtAR and that of the AR-2KR. Based on these analy- 
ses, the mutation of the AR SUMOylation sites resulted in 
2675 AR-2KR-preferred ARBs and 1561 wtAR-preferred 
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Figure 2. Androgen-regulated genes in the wtAR- and the AR-2KR-expressing PC-3 cells are partly enriched in different biological functions. Androgen- 
regulated genes from both cell types were subjected to Ingenuity Pathway Analysis® (IPA). Threshold for a gene list to be significantly involved in a 
particular biological function was /"-value < 0.05 (or -logio (B-H adjusted /"-value) > 1.30). (A) Comparative analysis showing the five most significantly 
enriched inolecular and cellular functions. Percentage and number of genes in each category overlapping the set of androgen-regulated genes are shown. 
(B) Heat map of hierarchically clustered biosynthesis-related canonical pathways. 
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Figure 3. Mutation of AR SUMOylation sites promotes cell proliferation and influences apoptosis sensitivity. (A) Amount of viable cells after vehicle 
or R1881 treatment was measured 48 h and 72 h after exposure using CellTiter-Glo® Luminescent Cell Viability Assay (Promega). Growth percentages 
represent the means {n = 5) ±SDs. (B) RT-qPCR analysis of examples of genes that were differentially expressed between the cell lines and previously shown 
to have a role in cell proliferation. (C) Cellular apoptosis, measured using Cell Death Detection ELISA kit (Roche) after 20 h exposure to camptothecin 
(1.5 (jlM), and expressed as the mean relative level of apoptosis (±SD, n = 3). Statistically significant differences {***/• < 0.001, **P < 0.01 and *P < 0.05; 
Student's r-test) between the wtAR and the AR-2KR cells after androgen exposure are indicated. 



ARBs, while 16 222 ARBs remained unchanged (Figure 4A 
and B). 

Similarly, ChlP-seq analysis of the wtAR and AR-2KR 
HEK293 cells revealed that 1473 AR-2KR-preferred ARBs 
and 806 wtAR-preferred ARBs, while 25 561 ARBs were 
insensitive to the mutation (Supplementary Figure S7A- 
C). Comparison of the wtAR PC-3 to the wtAR HEK293 
ChlP-seq data showed 45% overlap between the ARBs 
(Supplementary Figure S7D). Similarly, -40% of wtAR 
PC-3 ARBs overlapped with the ARBs of VCaP prostate 



cancer cells expressing endogenous AR (Supplementary 
Figure S7D), (31). The overall distribution of the ARBs in 
relation to gene structures was very similar in our HEK293 
and PC-3 cell models; most of the ARBs resided in introns 
or were intergenic, while only <14% resided within proxi- 
mal promoter regions (— 10 kb to +1 kb from transcription 
start sites, TSSs) (Figure 4C, Supplementary Figure S7E). 

IPA of genes in the PC-3 cells that had a preferred ARB 
within ±100 kb of their TSS showed that the AR-2KR- 
preferred ARBs were significantly associated with cellular 
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Figure 4. Genome-wide AR-binding events are influenced by AR 
SUMOylation sites. ClilP-seq analysis of AR chromatin interactions in the 
WtAR and the AR-2KR PC-3 cells after 1 h R1881 exposure. (A) Venn- 
diagram of AR binding events in wtAR- and AR-2KR-expressing cells 
(upper panel). Non-shared binding sites were analyzed further using get- 
DifferentialPeak tool with >2-fold difference as described in 'Materials 
and Methods' section to achieve final preferred binding sites. Heat map 
showing WtAR and AR-2KR ChlP-seq tag densities for wtAR- and AR- 
2KR-sliared and wtAR- or AR-2KR-preferred regions in a window ±2 
kb (bottom panel). (B) Comparison of the wtAR and the AR-2KR aver- 
age tag counts in ±500 bp from the centers of ARBs in three categories 
(shared/preferred). (C) Genomic distribution of the ARBs. 



development, cellular growth and proliferation and cellu- 
lar movement (Supplementary Figure S8A). Majority of 
the androgen-regulated genes both in the wtAR PC-3 cells 
(>77%) and in the AR-2KR PC-3 cells (>91%) harbored at 
least one ARB within ±100 kb of their TSS (Supplemen- 
tary Figure S8B, Supplementary file 3). However, on the 
genome-wide level, there was no clear difference in the as- 



sociation of the WtAR- and AR-2KR-preferred ARBs and 
the receptor type-preferred androgen target genes (Supple- 
mentary Figure S8B), indicating that the differences in the 
cistromes between the wtAR and the AR-2KR cells can- 
not alone explain the observed differences in their transcript 
levels. 

Although on the whole genome level the majority of the 
ARBs were insensitive to the AR SUMOylation sites, there 
were several androgen-regulated genes that harbored one 
or more wtAR- or AR-2KR-preferred ARBs (Figure 5). 
For example, CLDN8 that was androgen-regulated only in 
WtAR PC-3 cells did not show any clear ARB in the AR- 
2KR cells. Similarly, LRIGl was significantly androgen- 
regulated only in wtAR cells and it contained a wtAR- 
preferred ARB. In contrast, ADRAIB and TENM4 that 
showed a significantly stronger response to androgen in the 
AR-2KR cells possessed one or more AR-2KR-preferred 
ARBs. FKBP5, on the other hand, is an example of an AR 
target of which androgen regulation is insensitive to the AR 
SUMOylation sites and of which 16 ARBs out of 18 were 
shared ones. Examples of the wtAR- and the AR-2KR- 
preferred loci in the HEK293 cells are given in Supplemen- 
tary Figure S9. 

We next performed de novo motif analyses with all ARBs 
to identify over-represented transcription factor-binding 
motifs in the ARBs. As expected, the de novo ARE consen- 
sus (that closely resembles the ARE motif in the JASPAR- 
database) was the most enriched motif among all ARBs 
(in >60% of WtAR-, AR-2KR-preferred and shared ARBs) 
(Figure 6A and B). Further analysis revealed no clear dif- 
ferences in the number of AREs per ARE-containing ARBs 
between the preferred and shared ARBs, with the means 
(±SD) 1.28 ± 0.52, 1.20 ± 0.47 and 1.31 ± 0.56 for the 
WtAR-, the AR-2KR- and the shared ARBs, respectively. 
Other significantly enriched motifs were those for ERG, 
FOXAl, C/EBP and AP-1 (Figure 6A). While the de novo 
ARE and ERG motifs were fairly similarly enriched in 
all three ARB subsets, there were significant differences in 
the enrichment of FOXAl, C/EBP and AP-1 motifs to 
the three ARB subsets. Based on western blotting analy- 
sis, factors binding to these motifs (FOXAl, C/EBP(3 and 
c-Jun, major AP-1 component) are expressed in our PC3 
model cells (Supplementary Figure SIO). Both FOXAl and 
C/EBP motifs were found three times more often within the 
wtAR-preferred then AR-2KR-preferred ARBs, whereas 
the AP- 1 motif was nearly three times more prevalent within 
the AR-2KR-preferred ARBs (Figure 6B and C, Supple- 
mentary Figure Sll). As shown in Figure 6C, more pro- 
nounced loading of both the FOXAl and the C/EBP(3, 
for example, onto SAFB and ANKRD17 loci in wtAR 
than AR-2KR cells is in line with the in silico predictions 
in panel B. Moreover, c-Jun appears to be more avidly 
loaded, for example, onto ADRAIB locus in AR-2KR- 
than in wtAR-expressing PC3 cells (Figure 6C). De novo 
analysis of the HEK293 data showed, in addition to the 
AREs, a similar over-enrichment of FOXAl within the 
WtAR-preferred ARBs. GATAl was also over-enriched in 
the WtAR-preferred ARBs, whereas EBFl and PAX2 were 
more prevalent in the AR-2KR preferred ARBs (Supple- 
mentary Figure S12A and B). The differences in the en- 
riched motifs between the PC-3 and the HEK293 cells likely 
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reflect cell-specific differences in the expression of their cog- 
nate transcription factors. Compared to LNCaP and VCaP 
prostate cancer cells in which about half of the identified 
ARBs have a FOXAl motif (31,32), the PC-3 ARBs and 
the HEK293 ARBs less frequently harbor a FOXAl mo- 
tif This may be due to the lower amount of FOXAl in the 
latter two cell fines (Supplementary Figure SIO). 

Taken together, our ChlP-seq analyses suggest that 
SUMOylation regulates AR's interaction with the chro- 
matin, leading to the observed differences in the AR 
cistromes in both PC3 and HEK293 cells. In addition, a 
number of transcription factor-binding motifs are differen- 
tially enriched within the wtAR- and AR-2KR-preferred 
binding locations. 

DISCUSSION 

A large number of transcription factors are known to be 
covalently modified by SUMOs, but the regulatory role of 



these modifications has only rarely been addressed in an 
unbiased genome-wide fashion (5, 15, 33, 34). Here, we 
have investigated the effect of AR SUMOylation sites on 
AR chromatin occupancy and endogenous target gene ex- 
pression in a genome-wide fashion using stably wtAR- or 
SUMOylation site mutant receptor-expressing PC-3 and 
HEK293 cells. Already our recent expression analyses of 
a few AR target genes in the HEK293 cells suggested that 
the effect (enhancing, repressive or neutral) of SUMOyla- 
tion on the AR activity is target gene -dependent (14). Our 
genome-wide analyses of androgen-regulated genes in the 
WtAR and the AR SUMOylation mutant cells indicate that 
the SUMOylation sites do not simply repress the AR activ- 
ity on all target genes, as the mutant also exhibited atten- 
uated transcriptional activity on a number of target genes. 
Moreover, not all AR target genes were influenced by the 
AR SUMOylation sites. The SUMOylation not merely af- 
fected the expression of androgen-induced genes, but it also 
influenced the ability of AR to repress its target genes. Our 
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Studies focused on the PC-3 prostate cancer cells, because 
the cell background better resembles the natural environ- 
ment of AR function. A reasonable number (~30%) of the 
androgen-regulated genes in our PC-3 wtAR model cells 
was the same as in VCaP prostate cancer cells expressing 
amplified levels of endogenous AR (a model of hormone- 
refractory prostate cancer) (GSE30316). 

Pathway enrichment analysis of the genes differentially 
regulated by the wtAR and the AR SUMOylation mu- 
tant revealed that they are significantly associated with 
molecular and cellular functions of cellular movement, de- 
velopment, cell death and survival, cell morphology, and 
cellular growth and proliferation. Many genes with anti- 
proliferative effects, such as RASDl and LRIGl (26,27), 
showed significantly stronger androgen-up-regulation by 
the WtAR. On the other hand, genes, such as EFEMPl, 
DUSPl, ADRAIB (28-30), that were more robustly up- 
regulated by the SUMOylation-deficient AR are known 
to be associated with promotion of cell proliferation. In- 
terestingly, the genes LRIGl, DUSPl and RASDl are all 



significantly up-regulated in prostate cancers compared to 
nonmalignant prostates (Supplementary Figure SI 3) (http: 
//www.oncomine.org) (35). 

SUMOylation also influenced the AR cistrome. Albeit 
the majority of ARBs (shared ARBs) were not affected by 
the AR SUMOylation mutation, our replicated genome- 
wide ChlP-seq analyses in PC-3 cells revealed that ~10% 
of all identified ARBs were preferred either by wtAR (1561 
ARBs) or the AR SUMOylation mutant (2675 ARBs). 
These preferred ARBs were similarly distributed to genomic 
elements as the shared ARBs; with the majority of the 
ARBs in all three groups residing in intergenic or intronic 
regions and only a very small proportion locating to the 
proximal promoter regions. These genomic distribution fig- 
ures are similar to those of the ARBs recently reported 
from other prostate cancer cell fines (31,32,36,37). More- 
over, ~40% of PC-3 WtAR ARBs identified in this study 
overlap with those recently revealed in the VCaP cells. 

General comparison of motif signatures within the gene 
repression- and activation-associated ARBs revealed re- 
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markably similar features and the presence of AREs in both 
cases, suggesting that the AR-binding AREs also often me- 
diate gene repression, not only activation, as recently seen 
by us and others for glucocorticoid target genes (15,38). 
Of the androgen-regulated genes in PC-3 cells, ~80% and 
~90% of the wtAR and AR-2KR targets, respectively, are 
associated with at least one ARB. However, only 59/473 of 
the WtAR target genes showed wtAR-preferred ARBs and 
52/143 of the AR-2KR target genes displayed AR-2KR- 
preferred ARBs, suggesting that the preferred ARBs do 
not alone explain the differences in the androgen-regulated 
transcriptomes between the wtAR and the AR-2KR cells. 
However, differences in the sampling points of RNA pro- 
filing (16 h) and ChlP-seq (1 h) and single time point data 
hamper direct comparison and correlation of the gene ex- 
pression with the chromatin binding data. SUMOylation 
influences the AR's intranuclear mobility (based on FRAP 
assays) and this is also likely to be reflected on the AR's 
on-off kinetics in the chromatin (14). Moreover, different 
AR target genes display largely different mRNA induc- 
tion kinetics. Nevertheless, already based on our single time 
point data, several AR target loci differentially regulated 
due to the AR SUMOylation sites, such as CLDN8, LRIGl, 
ADRAIB and TENM4, showed differences in their pre- 
ferred ARBs and the level of ARB occupancy, which were in 
line with their androgen-regulated expression between the 
WtAR and the AR-2KR cells. 

Interestingly, the PC-3 cells expressing SUMOylation 
mutant AR proliferated faster and were more sensitive to 
apoptosis than the wtAR-expressing cells. A similar differ- 
ence was also detected with the HEK293 cells. These dif- 
ferences are perfectly in line with the altered androgen reg- 
ulation of several genes influencing cell growth. Fittingly, 
SUMOylation modulates the target gene selection of pro- 
gesterone receptor (PR) as well as glucocorticoid receptor 
(GR) such that their SUMO sensitive targets include genes 
associated with cell growth and proHferation (15,34). This is 
also the case with SUMOylation-defective microphthalmia- 
associated transcription factor (MITF) mutant associated 
with the predisposition to melanoma and renal carcinoma 
(39). As with both the AR and the GR, the MITF's 
chromatin occupancy is also influenced by the SUMOy- 
lation site mutation (39). Interestingly, however, our GR- 
expressing HEK293 model cells responded to the mu- 
tation of GR SUMOylation sites in a different fashion 
than our model AR cells, as the GR SUMOylation mu- 
tant cells displayed both more GR target genes and GR 
chromatin-binding sites (15). Also the attenuated abihty of 
SUMOylation-deficient GATA-1 to induce genes control- 
ling hematopoiesis has been linked to altered chromatin oc- 
cupancy (40). Furthermore, the disruption of orphan nu- 
clear receptor SF-1 (steroidogenic factor 1) SUMOylation 
in mice has intriguingly been linked to abnormal Hedgehog 
signahng and endocrine development (41). These data col- 
lectively suggest a wider role for SUMOylation in the regu- 
lation of target selection by sequence-specific transcription 
factors. 

Our binding motif analyses indicate that the classic ARE 
is similarly overrepresented among the wtAR- and AR- 
2KR-shared, as well as the wtAR-preferred and the AR- 
2KR-preferred ARBs. However, there were significant dif- 



ferences in the enrichment of other transcription factor- 
binding motifs between the wtAR-preferred and the AR- 
SUMOylation mutant-preferred ARBs: motifs for pioneer 
transcription factor FOXAl and C/EBP occurred more fre- 
quently among the wtAR-preferred ARBs, whereas AP-1 
motif was more prevalent among the AR-2KR preferred 
ARBs. The motif analyses thus imply that SUMOylation 
may influence the AR chromatin occupancy and target gene 
selection via regulating interactions with other sequence- 
specific transcription factors, such as FOXAl and AP-1, 
which may enhance or weaken the AR binding to the chro- 
matin. In conclusion, our objective genome-wide analyses 
reveal that SUMOylation does not simply repress the AR 
activity, but the modification regulates the receptor's target 
gene selection and plays an important role in controlling the 
proliferative effects of androgens. 
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